首页> 外文OA文献 >Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings

【2h】

Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings

机译：使用多模嵌入的动词无监督视觉消歧

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a new task, visual sense disambiguation for verbs: given an image and a verb, assign the correct sense of the verb, i.e., the one that describes the action depicted in the image. Just as textual word sense disambiguation is useful for a wide range of NLP tasks, visual sense disambiguation can be useful for multimodal tasks such as image retrieval, image description, and text illustration. We introduce VerSe, a new data set that augments existing multimodal data sets (COCO and TUHOI) with sense labels. We propose an unsupervised algorithm based on Lesk which performs visual sense disambiguation using textual, visual, or multimodal embeddings. We find that textual embeddings perform well when gold standard textual annotations (object labels and image descriptions) are available, while multimodal embeddings perform well on unannotated images. We also verify our findings by using the textual and multimodal embeddings as features in a supervised setting and analyse the performance of visual sense disambiguation task. VerSe is made publicly available and can be downloaded at: https://github.com/spandanagella/verse.

机译：我们引入了一项新任务，即对动词进行视觉歧义消除：给定一个图像和一个动词，为动词分配正确的意义，即描述图像中描述的动作的正确意义。正如文本意义上的歧义消除可用于多种NLP任务一样，视觉意义上的歧义消除可用于多模式任务，例如图像检索，图像描述和文本插图。我们介绍了VerSe，这是一个新的数据集，它使用感知标签增强了现有的多模式数据集（COCO和TUHOI）。我们提出了一种基于Lesk的无监督算法，该算法使用文本，视觉或多模式嵌入来执行视觉歧义消除。我们发现，当有黄金标准的文本注释（对象标签和图像描述）可用时，文本嵌入效果很好，而多模式嵌入在未注释的图像上表现良好。我们还通过使用文本和多模式嵌入作为监督环境中的功能来验证我们的发现，并分析视觉消除歧义任务的性能。 VerSe公开提供，可以从以下网址下载：https：//github.com/spandanagella/verse。

著录项

作者
Gella, Spandana; Lapata, Maria; Keller, Frank;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Improving English verb sense disambiguation performance with linguistically motivated features and clear sense distinction boundaries [J] . Jinying Chen, Martha S. Palmer Computers and the Humanities . 2009,第2期

机译：借助语言动机特征和清晰的语言区分界限，提高英语动词的歧义消除性能
2. Processing of Arabic Diacritical Marks: Phonological-Syntactic Disambiguation of Homographic Verbs and Visual Crowding Effects [J] . Hermena Ehab W., Drieghe Denis, Hellmuth Sam, Journal of experimental psychology. human perception and performance . 2015,第2期

机译：阿拉伯变音符号的处理：谐音动词的音韵句法歧义消除和视觉拥挤效应
3. Exploratory Study of Word Sense Disambiguation Methods for Verbs in Brazilian Portuguese [J] . MARCO ANTONIO SOBREVILLA CABEZUDO, THIAGO ALEXANDRE SALGUEIRO PARDO International journal of computational linguistics and applications . 2015,第1期

机译：巴西葡萄牙语动词词义消歧方法的探索性研究
4. Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings [C] . Spandana Gella, Mirella Lapata, Frank Keller Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：使用多模态嵌入的动词无监督视觉歧义消除
5. Translating the Italian experience: An analysis of verbs of cognition and perception to support sense disambiguation in machine translation [D] . Vanni, Michelle. 2000

机译：翻译意大利经验：对认知和感知动词的分析，以支持机器翻译中的歧义消除
6. Research and applications: Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods [O] . Rachel Chasin, Anna Rumshisky, Ozlem Uzuner, 2014

机译：研究与应用：临床领域中的单词歧义消除：知识丰富和知识匮乏的无监督方法的比较
7. Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings [O] . Gella, Spandana, Lapata, Mirella, Keller, Frank 2016

机译：使用多模态的动词无监督视觉消歧的嵌入
8. Searching Semantic Resources for Complex Selectional Restrictions to Support Verb Sense Disambiguation. [R] . Taylor, M., Carlson, L., Poisson, S., 2010

机译：搜索语义资源的复杂选择限制以支持动词意义消歧。

Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅